Truth discovery under resource constraints
نویسنده
چکیده
Social computing initiatives that mark a shift from personal computing towards computations involving collective action, are driving a dramatic evolution in modern decision-making. Decisionmakers or stakeholders can now tap into the power of tremendous numbers and varieties of information sources (crowds), capable of providing information for decisions that could impact individual or collective well-being. More information sources does not necessarily translate to better information quality, however. Social influence in online environments, for example, may bias collective opinions. In addition, querying information sources may be costly, in terms of energy, bandwidth, delay overheads, etc., in real-world applications. In this research, we propose a general approach for truth discovery in resource constrained environments, where there is uncertainty regarding the trustworthiness of sources. First, we present a model of diversity, which allows a decision-maker to form groups, made up of sources likely to provide similar reports. We demonstrate that this mechanism is able to identify different forms of dependencies among information sources, and hence has the potential to mitigate the risk of double-counting evidence due to correlated biases among information sources. Secondly, we present a sampling decision-making model, which combines source diversification and reinforcement learning to drive sampling strategy. We demonstrate that this mechanism is effective in guiding sampling decisions given different task constraints or information needs. We evaluate our model by comparing it with algorithms representing classes of existing approaches reported in the literature.
منابع مشابه
Strategies for Truth Discovery under Resource Constraints ( Extended Abstract )
We present a decision-theoretic approach for sampling information sources in resource-constrained environments, where there is uncertainty regarding source trustworthiness. We exploit diversity among sources to stratify the population into homogeneous subgroups to both minimise redundant sampling and mitigate the effect of source collusion. We show through empirical evaluation that our model is...
متن کاملStrategies for Truth Discovery under Resource Constraints
We present a decision-theory based approach for efficiently sampling information sources in resource-constrained environments, where there is uncertainty regarding source trustworthiness. We exploit diversity among sources to stratify the population into homogeneous subgroups to both minimise redundant sampling and mitigate the effect of certain biases (e.g., source collusion). After presenting...
متن کاملExploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels
The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...
متن کاملA novel mathematical model for a hybrid flow shop scheduling problem under buffer and resource limitations-A case study
Scheduling problems play a big role in manufacturing and planning the production for increasing the production efficiency and assigning the resources to operations. Furthermore, in many manufacturing systems there is a physical space between stages that called intermediate buffers. In this study, a model is proposed for minimizing the makespan of a hybrid flow shop scheduling problem with inter...
متن کاملSINGLE MACHINE DUE DATE ASSIGNMENT SCHEDULING PROBLEM WITH PRECEDENCE CONSTRAINTS AND CONTROLLABLE PROCESSING TIMES IN FUZZY ENVIRONMENT
In this paper, a due date assignment scheduling problem with precedence constraints and controllable processing times in uncertain environment is investigated, in which the basic processing time of each job is assumed to be the symmetric trapezoidal fuzzy number, and the linear resource consumption function is used.The objective is to minimize the crisp possibilistic mean (or expected) value of...
متن کامل